In this project, I developed a high availability and disaster recovery plan for Progreso Ltd., a finance-based company recently impacted by data loss due to a fire outbreak. I began by analyzing the company’s infrastructure requirements and proposed a resilient architecture using Hyper-V clustering. I configured two Hyper-V servers as a failover cluster, enabling seamless failover and failback operations. This setup ensures the continuous availability of critical virtual machines and services, even if one server fails, reducing downtime and ensuring business continuity.
To enhance data resilience and performance, I implemented Storage Spaces Direct (S2D) to pool storage resources across servers and create a fault-tolerant, high-speed storage solution. I also laid out a robust networking infrastructure with load balancing and redundancy to support optimal communication between the clustered nodes and storage systems. To further strengthen the setup, I recommended using Hyper-V Live Migration for performing monthly system updates without downtime, and Storage Migration Service to upgrade legacy storage systems to the S2D environment without disrupting operations.
For disaster recovery, I incorporated Azure Site Recovery to replicate virtual machines to the cloud, allowing Progreso to swiftly failover in case of any local disaster. I proposed cost optimization strategies, including hardware selection, energy-saving policies, and leveraging cloud services for specific functions like backup. I also addressed comprehensive security considerations such as network segmentation, server hardening, data encryption, access control, cloud security using Azure Security Centre, and ongoing monitoring through SIEM tools. My solution provides Progreso with a secure, scalable, and highly available infrastructure that ensures operational continuity and positions the company for sustainable growth.